Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 1780 |
| Missing cells | 8883 |
| Missing cells (%) | 31.2% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 212.2 KiB |
| Average record size in memory | 122.1 B |
Variable types
| NUM | 13 |
|---|---|
| CAT | 2 |
| BOOL | 1 |
country has a high cardinality: 178 distinct values | High cardinality |
slaughtered_pigs_hd is highly correlated with production_pigs_tonnes and 1 other fields | High correlation |
production_pigs_tonnes is highly correlated with slaughtered_pigs_hd and 1 other fields | High correlation |
stocks_pigs_hd is highly correlated with production_pigs_tonnes and 1 other fields | High correlation |
producerprice_pigs_live_lcupertonne is highly correlated with producerprice_pigs_carcass_lcupertonne and 2 other fields | High correlation |
producerprice_pigs_carcass_lcupertonne is highly correlated with producerprice_pigs_live_lcupertonne and 2 other fields | High correlation |
producerprice_pigs_carcass_slcpertonne is highly correlated with producerprice_pigs_carcass_lcupertonne and 2 other fields | High correlation |
producerprice_pigs_live_slcpertonne is highly correlated with producerprice_pigs_carcass_lcupertonne and 2 other fields | High correlation |
yield_pigs_hgperhd has 40 (2.2%) missing values | Missing |
production_pigs_tonnes has 29 (1.6%) missing values | Missing |
slaughtered_pigs_hd has 20 (1.1%) missing values | Missing |
stocks_pigs_hd has 40 (2.2%) missing values | Missing |
producerprice_pigs_carcass_lcupertonne has 1373 (77.1%) missing values | Missing |
producerprice_pigs_live_lcupertonne has 1136 (63.8%) missing values | Missing |
producerprice_pigs_carcass_slcpertonne has 1373 (77.1%) missing values | Missing |
producerprice_pigs_live_slcpertonne has 1136 (63.8%) missing values | Missing |
producerprice_pigs_carcass_usdpertonne has 1373 (77.1%) missing values | Missing |
producerprice_pigs_live_usdpertonne has 1143 (64.2%) missing values | Missing |
producerprice_pigs_carcass_index has 619 (34.8%) missing values | Missing |
producerprice_pigs_live_index has 601 (33.8%) missing values | Missing |
country is uniformly distributed | Uniform |
Reproduction
| Analysis started | 2022-04-08 16:39:18.206035 |
|---|---|
| Analysis finished | 2022-04-08 16:39:48.156390 |
| Duration | 29.95 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 178 |
|---|---|
| Distinct (%) | 10.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 13.9 KiB |
| Poland | 10 |
|---|---|
| Kenya | 10 |
| United Kingdom of Great Britain and Northern Ireland | 10 |
| Botswana | 10 |
| United States of America | 10 |
| Other values (173) |
| Value | Count | Frequency (%) | |
| Poland | 10 | 0.6% | |
| Kenya | 10 | 0.6% | |
| United Kingdom of Great Britain and Northern Ireland | 10 | 0.6% | |
| Botswana | 10 | 0.6% | |
| United States of America | 10 | 0.6% | |
| Azerbaijan | 10 | 0.6% | |
| New Caledonia | 10 | 0.6% | |
| Cook Islands | 10 | 0.6% | |
| Lao People's Democratic Republic | 10 | 0.6% | |
| China, Taiwan Province of | 10 | 0.6% | |
| Other values (168) | 1680 | 94.4% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 52 |
|---|---|
| Median length | 8 |
| Mean length | 10.34269663 |
| Min length | 4 |
year
Real number (ℝ≥0)
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2015.5 |
|---|---|
| Minimum | 2011 |
| Maximum | 2020 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 2011 |
|---|---|
| 5-th percentile | 2011 |
| Q1 | 2013 |
| median | 2015.5 |
| Q3 | 2018 |
| 95-th percentile | 2020 |
| Maximum | 2020 |
| Range | 9 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.873088484 |
|---|---|
| Coefficient of variation (CV) | 0.001425496643 |
| Kurtosis | -1.224309899 |
| Mean | 2015.5 |
| Median Absolute Deviation (MAD) | 2.5 |
| Skewness | 0 |
| Sum | 3587590 |
| Variance | 8.254637437 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=10)
| Value | Count | Frequency (%) | |
| 2011 | 178 | 10.0% | |
| 2012 | 178 | 10.0% | |
| 2013 | 178 | 10.0% | |
| 2014 | 178 | 10.0% | |
| 2015 | 178 | 10.0% | |
| 2016 | 178 | 10.0% | |
| 2017 | 178 | 10.0% | |
| 2018 | 178 | 10.0% | |
| 2019 | 178 | 10.0% | |
| 2020 | 178 | 10.0% |
| Value | Count | Frequency (%) | |
| 2011 | 178 | 10.0% | |
| 2012 | 178 | 10.0% | |
| 2013 | 178 | 10.0% | |
| 2014 | 178 | 10.0% | |
| 2015 | 178 | 10.0% |
| Value | Count | Frequency (%) | |
| 2020 | 178 | 10.0% | |
| 2019 | 178 | 10.0% | |
| 2018 | 178 | 10.0% | |
| 2017 | 178 | 10.0% | |
| 2016 | 178 | 10.0% |
| Distinct | 623 |
|---|---|
| Distinct (%) | 35.8% |
| Missing | 40 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 655.6821839 |
|---|---|
| Minimum | 155 |
| Maximum | 1652 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 155 |
|---|---|
| 5-th percentile | 300 |
| Q1 | 450 |
| median | 650 |
| Q3 | 823.25 |
| 95-th percentile | 1016.15 |
| Maximum | 1652 |
| Range | 1497 |
| Interquartile range (IQR) | 373.25 |
Descriptive statistics
| Standard deviation | 267.374801 |
|---|---|
| Coefficient of variation (CV) | 0.4077810982 |
| Kurtosis | 1.913400083 |
| Mean | 655.6821839 |
| Median Absolute Deviation (MAD) | 184 |
| Skewness | 0.9748975742 |
| Sum | 1140887 |
| Variance | 71489.28421 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 500 | 74 | 4.2% | |
| 400 | 69 | 3.9% | |
| 300 | 66 | 3.7% | |
| 450 | 37 | 2.1% | |
| 600 | 31 | 1.7% | |
| 550 | 25 | 1.4% | |
| 1650 | 20 | 1.1% | |
| 700 | 19 | 1.1% | |
| 350 | 18 | 1.0% | |
| 740 | 16 | 0.9% | |
| Other values (613) | 1365 | 76.7% | |
| (Missing) | 40 | 2.2% |
| Value | Count | Frequency (%) | |
| 155 | 1 | 0.1% | |
| 171 | 1 | 0.1% | |
| 175 | 1 | 0.1% | |
| 190 | 1 | 0.1% | |
| 199 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 1652 | 2 | 0.1% | |
| 1651 | 1 | 0.1% | |
| 1650 | 20 | 1.1% | |
| 1649 | 4 | 0.2% | |
| 1647 | 1 | 0.1% |
| Distinct | 1609 |
|---|---|
| Distinct (%) | 91.9% |
| Missing | 29 |
| Missing (%) | 1.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 959594.7938 |
|---|---|
| Minimum | 0 |
| Maximum | 57661871 |
| Zeros | 11 |
| Zeros (%) | 0.6% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 114.5 |
| Q1 | 1922.5 |
| median | 18709 |
| Q3 | 217490 |
| 95-th percentile | 2155052.5 |
| Maximum | 57661871 |
| Range | 57661871 |
| Interquartile range (IQR) | 215567.5 |
Descriptive statistics
| Standard deviation | 5667773.922 |
|---|---|
| Coefficient of variation (CV) | 5.90642421 |
| Kurtosis | 79.09455491 |
| Mean | 959594.7938 |
| Median Absolute Deviation (MAD) | 18510 |
| Skewness | 8.824259394 |
| Sum | 1680250484 |
| Variance | 3.212366123e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 11 | 0.6% | |
| 19 | 9 | 0.5% | |
| 66 | 5 | 0.3% | |
| 420 | 5 | 0.3% | |
| 67 | 4 | 0.2% | |
| 76 | 4 | 0.2% | |
| 120000 | 4 | 0.2% | |
| 75 | 4 | 0.2% | |
| 117 | 4 | 0.2% | |
| 555 | 4 | 0.2% | |
| Other values (1599) | 1697 | 95.3% | |
| (Missing) | 29 | 1.6% |
| Value | Count | Frequency (%) | |
| 0 | 11 | 0.6% | |
| 19 | 9 | 0.5% | |
| 20 | 1 | 0.1% | |
| 31 | 1 | 0.1% | |
| 32 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 57661871 | 1 | 0.1% | |
| 57415740 | 1 | 0.1% | |
| 56713900 | 1 | 0.1% | |
| 56454000 | 1 | 0.1% | |
| 55917319 | 1 | 0.1% |
| Distinct | 1672 |
|---|---|
| Distinct (%) | 95.0% |
| Missing | 20 |
| Missing (%) | 1.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12217652.85 |
|---|---|
| Minimum | 0 |
| Maximum | 744917877 |
| Zeros | 17 |
| Zeros (%) | 1.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2499.5 |
| Q1 | 33170.5 |
| median | 355303.5 |
| Q3 | 2622370.5 |
| 95-th percentile | 25007783 |
| Maximum | 744917877 |
| Range | 744917877 |
| Interquartile range (IQR) | 2589200 |
Descriptive statistics
| Standard deviation | 74272196.96 |
|---|---|
| Coefficient of variation (CV) | 6.07908883 |
| Kurtosis | 80.15663181 |
| Mean | 12217652.85 |
| Median Absolute Deviation (MAD) | 350071 |
| Skewness | 8.934247075 |
| Sum | 2.150306901e+10 |
| Variance | 5.516359241e+15 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 0 | 17 | 1.0% | |
| 18500 | 5 | 0.3% | |
| 648 | 5 | 0.3% | |
| 14000 | 4 | 0.2% | |
| 2500 | 4 | 0.2% | |
| 1500000 | 4 | 0.2% | |
| 360000 | 3 | 0.2% | |
| 550000 | 3 | 0.2% | |
| 13000 | 3 | 0.2% | |
| 25000 | 3 | 0.2% | |
| Other values (1662) | 1709 | 96.0% | |
| (Missing) | 20 | 1.1% |
| Value | Count | Frequency (%) | |
| 0 | 17 | 1.0% | |
| 76 | 1 | 0.1% | |
| 85 | 1 | 0.1% | |
| 87 | 1 | 0.1% | |
| 645 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 744917877 | 1 | 0.1% | |
| 735104000 | 1 | 0.1% | |
| 734077183 | 1 | 0.1% | |
| 726039838 | 1 | 0.1% | |
| 724156000 | 1 | 0.1% |
| Distinct | 1611 |
|---|---|
| Distinct (%) | 92.6% |
| Missing | 40 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8127653.168 |
|---|---|
| Minimum | 10 |
| Maximum | 486742946 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 3579.05 |
| Q1 | 37046.25 |
| median | 433659.5 |
| Q3 | 2443146 |
| 95-th percentile | 17210648.55 |
| Maximum | 486742946 |
| Range | 486742936 |
| Interquartile range (IQR) | 2406099.75 |
Descriptive statistics
| Standard deviation | 48092039.58 |
|---|---|
| Coefficient of variation (CV) | 5.917088068 |
| Kurtosis | 81.40257339 |
| Mean | 8127653.168 |
| Median Absolute Deviation (MAD) | 427923 |
| Skewness | 8.990088586 |
| Sum | 1.414211651e+10 |
| Variance | 2.312844271e+15 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 5000 | 16 | 0.9% | |
| 1000 | 10 | 0.6% | |
| 3000 | 9 | 0.5% | |
| 11 | 6 | 0.3% | |
| 5500 | 5 | 0.3% | |
| 35000 | 5 | 0.3% | |
| 30000 | 5 | 0.3% | |
| 50000 | 4 | 0.2% | |
| 32200 | 4 | 0.2% | |
| 8000 | 4 | 0.2% | |
| Other values (1601) | 1672 | 93.9% | |
| (Missing) | 40 | 2.2% |
| Value | Count | Frequency (%) | |
| 10 | 1 | 0.1% | |
| 11 | 6 | 0.3% | |
| 12 | 1 | 0.1% | |
| 13 | 1 | 0.1% | |
| 14 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 486742946 | 1 | 0.1% | |
| 485112117 | 1 | 0.1% | |
| 484914637 | 1 | 0.1% | |
| 480302400 | 1 | 0.1% | |
| 478931400 | 2 | 0.1% |
country_inscope
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.7 KiB |
| False | |
|---|---|
| True | 130 |
| Value | Count | Frequency (%) | |
| False | 1650 | 92.7% | |
| True | 130 | 7.3% |
| Distinct | 377 |
|---|---|
| Distinct (%) | 92.6% |
| Missing | 1373 |
| Missing (%) | 77.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2030616.16 |
|---|---|
| Minimum | 1139 |
| Maximum | 46711000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 1139 |
|---|---|
| 5-th percentile | 1404.6 |
| Q1 | 2074.5 |
| median | 17621 |
| Q3 | 183015 |
| 95-th percentile | 8048333.3 |
| Maximum | 46711000 |
| Range | 46709861 |
| Interquartile range (IQR) | 180940.5 |
Descriptive statistics
| Standard deviation | 7894335.045 |
|---|---|
| Coefficient of variation (CV) | 3.88765499 |
| Kurtosis | 21.40139092 |
| Mean | 2030616.16 |
| Median Absolute Deviation (MAD) | 16121 |
| Skewness | 4.687553628 |
| Sum | 826460777 |
| Variance | 6.23205258e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 17621 | 4 | 0.2% | |
| 6480 | 4 | 0.2% | |
| 17637 | 3 | 0.2% | |
| 435400 | 3 | 0.2% | |
| 2750000 | 3 | 0.2% | |
| 1520 | 3 | 0.2% | |
| 476030 | 2 | 0.1% | |
| 4500 | 2 | 0.1% | |
| 9500 | 2 | 0.1% | |
| 1584 | 2 | 0.1% | |
| Other values (367) | 379 | 21.3% | |
| (Missing) | 1373 | 77.1% |
| Value | Count | Frequency (%) | |
| 1139 | 1 | 0.1% | |
| 1140 | 1 | 0.1% | |
| 1144 | 1 | 0.1% | |
| 1232 | 1 | 0.1% | |
| 1245 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 46711000 | 1 | 0.1% | |
| 46390000 | 1 | 0.1% | |
| 46263000 | 1 | 0.1% | |
| 46170000 | 1 | 0.1% | |
| 46075634 | 1 | 0.1% |
| Distinct | 604 |
|---|---|
| Distinct (%) | 93.8% |
| Missing | 1136 |
| Missing (%) | 63.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 985987.0342 |
|---|---|
| Minimum | 829 |
| Maximum | 34651034 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 829 |
|---|---|
| 5-th percentile | 1104.45 |
| Q1 | 2577.5 |
| median | 14643.5 |
| Q3 | 132615.25 |
| 95-th percentile | 3552622.95 |
| Maximum | 34651034 |
| Range | 34650205 |
| Interquartile range (IQR) | 130037.75 |
Descriptive statistics
| Standard deviation | 4374364.699 |
|---|---|
| Coefficient of variation (CV) | 4.436533694 |
| Kurtosis | 34.19726852 |
| Mean | 985987.0342 |
| Median Absolute Deviation (MAD) | 13430.5 |
| Skewness | 5.814714403 |
| Sum | 634975650 |
| Variance | 1.913506652e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 13228 | 12 | 0.7% | |
| 6801 | 7 | 0.4% | |
| 5040 | 4 | 0.2% | |
| 11000 | 4 | 0.2% | |
| 1928571 | 3 | 0.2% | |
| 21753 | 3 | 0.2% | |
| 1212 | 3 | 0.2% | |
| 1034 | 2 | 0.1% | |
| 1424 | 2 | 0.1% | |
| 6418 | 2 | 0.1% | |
| Other values (594) | 602 | 33.8% | |
| (Missing) | 1136 | 63.8% |
| Value | Count | Frequency (%) | |
| 829 | 1 | 0.1% | |
| 893 | 1 | 0.1% | |
| 912 | 1 | 0.1% | |
| 936 | 1 | 0.1% | |
| 957 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 34651034 | 1 | 0.1% | |
| 32688916 | 1 | 0.1% | |
| 31529793 | 1 | 0.1% | |
| 30757386 | 2 | 0.1% | |
| 30100000 | 1 | 0.1% |
| Distinct | 377 |
|---|---|
| Distinct (%) | 92.6% |
| Missing | 1373 |
| Missing (%) | 77.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1907944.464 |
|---|---|
| Minimum | 1139 |
| Maximum | 46711000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 1139 |
|---|---|
| 5-th percentile | 1404.6 |
| Q1 | 2074.5 |
| median | 17621 |
| Q3 | 176108.5 |
| 95-th percentile | 6479928.8 |
| Maximum | 46711000 |
| Range | 46709861 |
| Interquartile range (IQR) | 174034 |
Descriptive statistics
| Standard deviation | 7728339.622 |
|---|---|
| Coefficient of variation (CV) | 4.050610365 |
| Kurtosis | 23.49112978 |
| Mean | 1907944.464 |
| Median Absolute Deviation (MAD) | 16113 |
| Skewness | 4.913892725 |
| Sum | 776533397 |
| Variance | 5.972723332e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 17621 | 4 | 0.2% | |
| 6480 | 4 | 0.2% | |
| 17637 | 3 | 0.2% | |
| 2750000 | 3 | 0.2% | |
| 1520 | 3 | 0.2% | |
| 435400 | 3 | 0.2% | |
| 2600 | 2 | 0.1% | |
| 9500 | 2 | 0.1% | |
| 15432 | 2 | 0.1% | |
| 1584 | 2 | 0.1% | |
| Other values (367) | 379 | 21.3% | |
| (Missing) | 1373 | 77.1% |
| Value | Count | Frequency (%) | |
| 1139 | 1 | 0.1% | |
| 1140 | 1 | 0.1% | |
| 1144 | 1 | 0.1% | |
| 1232 | 1 | 0.1% | |
| 1245 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 46711000 | 1 | 0.1% | |
| 46390000 | 1 | 0.1% | |
| 46263000 | 1 | 0.1% | |
| 46170000 | 1 | 0.1% | |
| 46075634 | 1 | 0.1% |
| Distinct | 601 |
|---|---|
| Distinct (%) | 93.3% |
| Missing | 1136 |
| Missing (%) | 63.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 812224.1506 |
|---|---|
| Minimum | 829 |
| Maximum | 34651034 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 829 |
|---|---|
| 5-th percentile | 1120.75 |
| Q1 | 2351.5 |
| median | 13228 |
| Q3 | 125423.75 |
| 95-th percentile | 2209493.8 |
| Maximum | 34651034 |
| Range | 34650205 |
| Interquartile range (IQR) | 123072.25 |
Descriptive statistics
| Standard deviation | 3992165.526 |
|---|---|
| Coefficient of variation (CV) | 4.915103205 |
| Kurtosis | 46.45464012 |
| Mean | 812224.1506 |
| Median Absolute Deviation (MAD) | 12046.5 |
| Skewness | 6.765222472 |
| Sum | 523072353 |
| Variance | 1.593738559e+13 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 13228 | 12 | 0.7% | |
| 6801 | 7 | 0.4% | |
| 11000 | 4 | 0.2% | |
| 2486 | 4 | 0.2% | |
| 5040 | 4 | 0.2% | |
| 1928571 | 3 | 0.2% | |
| 1212 | 3 | 0.2% | |
| 1034 | 2 | 0.1% | |
| 8075 | 2 | 0.1% | |
| 6418 | 2 | 0.1% | |
| Other values (591) | 601 | 33.8% | |
| (Missing) | 1136 | 63.8% |
| Value | Count | Frequency (%) | |
| 829 | 1 | 0.1% | |
| 936 | 1 | 0.1% | |
| 957 | 1 | 0.1% | |
| 987 | 1 | 0.1% | |
| 1013 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 34651034 | 1 | 0.1% | |
| 32688916 | 1 | 0.1% | |
| 31529793 | 1 | 0.1% | |
| 30757386 | 2 | 0.1% | |
| 30100000 | 1 | 0.1% |
| Distinct | 381 |
|---|---|
| Distinct (%) | 93.6% |
| Missing | 1373 |
| Missing (%) | 77.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2822.614251 |
|---|---|
| Minimum | 1236 |
| Maximum | 6957 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 1236 |
|---|---|
| 5-th percentile | 1534.6 |
| Q1 | 1982.5 |
| median | 2454 |
| Q3 | 3280.5 |
| 95-th percentile | 5673.2 |
| Maximum | 6957 |
| Range | 5721 |
| Interquartile range (IQR) | 1298 |
Descriptive statistics
| Standard deviation | 1226.677467 |
|---|---|
| Coefficient of variation (CV) | 0.4345891283 |
| Kurtosis | 1.438180284 |
| Mean | 2822.614251 |
| Median Absolute Deviation (MAD) | 579 |
| Skewness | 1.408815714 |
| Sum | 1148804 |
| Variance | 1504737.607 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 2544 | 4 | 0.2% | |
| 6526 | 4 | 0.2% | |
| 3160 | 3 | 0.2% | |
| 6532 | 3 | 0.2% | |
| 1835 | 2 | 0.1% | |
| 1952 | 2 | 0.1% | |
| 1669 | 2 | 0.1% | |
| 3091 | 2 | 0.1% | |
| 1707 | 2 | 0.1% | |
| 2785 | 2 | 0.1% | |
| Other values (371) | 381 | 21.4% | |
| (Missing) | 1373 | 77.1% |
| Value | Count | Frequency (%) | |
| 1236 | 1 | 0.1% | |
| 1261 | 1 | 0.1% | |
| 1263 | 1 | 0.1% | |
| 1336 | 1 | 0.1% | |
| 1351 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 6957 | 1 | 0.1% | |
| 6840 | 1 | 0.1% | |
| 6532 | 3 | 0.2% | |
| 6526 | 4 | 0.2% | |
| 6461 | 1 | 0.1% |
| Distinct | 542 |
|---|---|
| Distinct (%) | 85.1% |
| Missing | 1143 |
| Missing (%) | 64.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2060.464678 |
|---|---|
| Minimum | 414 |
| Maximum | 7174 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 414 |
|---|---|
| 5-th percentile | 959.2 |
| Q1 | 1382 |
| median | 1730 |
| Q3 | 2451 |
| 95-th percentile | 4535.8 |
| Maximum | 7174 |
| Range | 6760 |
| Interquartile range (IQR) | 1069 |
Descriptive statistics
| Standard deviation | 1054.519757 |
|---|---|
| Coefficient of variation (CV) | 0.51178735 |
| Kurtosis | 3.403649037 |
| Mean | 2060.464678 |
| Median Absolute Deviation (MAD) | 460 |
| Skewness | 1.690291495 |
| Sum | 1312516 |
| Variance | 1112011.919 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 4899 | 12 | 0.7% | |
| 1361 | 4 | 0.2% | |
| 2486 | 4 | 0.2% | |
| 4074 | 4 | 0.2% | |
| 1849 | 3 | 0.2% | |
| 1143 | 3 | 0.2% | |
| 1721 | 3 | 0.2% | |
| 1363 | 3 | 0.2% | |
| 1317 | 2 | 0.1% | |
| 1691 | 2 | 0.1% | |
| Other values (532) | 597 | 33.5% | |
| (Missing) | 1143 | 64.2% |
| Value | Count | Frequency (%) | |
| 414 | 1 | 0.1% | |
| 463 | 1 | 0.1% | |
| 473 | 1 | 0.1% | |
| 507 | 1 | 0.1% | |
| 544 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 7174 | 1 | 0.1% | |
| 6881 | 1 | 0.1% | |
| 6804 | 1 | 0.1% | |
| 6322 | 1 | 0.1% | |
| 5537 | 1 | 0.1% |
| Distinct | 114 |
|---|---|
| Distinct (%) | 9.8% |
| Missing | 619 |
| Missing (%) | 34.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.464255 |
|---|---|
| Minimum | 29 |
| Maximum | 370 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 29 |
|---|---|
| 5-th percentile | 75 |
| Q1 | 93 |
| median | 100 |
| Q3 | 109 |
| 95-th percentile | 132 |
| Maximum | 370 |
| Range | 341 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 23.13451606 |
|---|---|
| Coefficient of variation (CV) | 0.2257813329 |
| Kurtosis | 46.91687997 |
| Mean | 102.464255 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 4.759520432 |
| Sum | 118961 |
| Variance | 535.2058333 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 100 | 58 | 3.3% | |
| 98 | 45 | 2.5% | |
| 99 | 44 | 2.5% | |
| 101 | 42 | 2.4% | |
| 95 | 41 | 2.3% | |
| 94 | 41 | 2.3% | |
| 96 | 37 | 2.1% | |
| 106 | 37 | 2.1% | |
| 104 | 36 | 2.0% | |
| 108 | 35 | 2.0% | |
| Other values (104) | 745 | 41.9% | |
| (Missing) | 619 | 34.8% |
| Value | Count | Frequency (%) | |
| 29 | 1 | 0.1% | |
| 31 | 1 | 0.1% | |
| 32 | 1 | 0.1% | |
| 36 | 2 | 0.1% | |
| 48 | 1 | 0.1% |
| Value | Count | Frequency (%) | |
| 370 | 1 | 0.1% | |
| 360 | 1 | 0.1% | |
| 345 | 1 | 0.1% | |
| 327 | 1 | 0.1% | |
| 195 | 1 | 0.1% |
| Distinct | 124 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 601 |
| Missing (%) | 33.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 102.8481764 |
|---|---|
| Minimum | 10 |
| Maximum | 522 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 13.9 KiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 74 |
| Q1 | 93 |
| median | 100 |
| Q3 | 108 |
| 95-th percentile | 129.1 |
| Maximum | 522 |
| Range | 512 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 27.4047482 |
|---|---|
| Coefficient of variation (CV) | 0.2664582801 |
| Kurtosis | 63.74820782 |
| Mean | 102.8481764 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 5.586763725 |
| Sum | 121258 |
| Variance | 751.0202238 |
| Monotocity | Not monotonic |
Histogram with fixed size bins (bins=50)
| Value | Count | Frequency (%) | |
| 100 | 69 | 3.9% | |
| 105 | 45 | 2.5% | |
| 101 | 45 | 2.5% | |
| 98 | 44 | 2.5% | |
| 99 | 43 | 2.4% | |
| 97 | 43 | 2.4% | |
| 94 | 37 | 2.1% | |
| 96 | 36 | 2.0% | |
| 102 | 36 | 2.0% | |
| 104 | 34 | 1.9% | |
| Other values (114) | 747 | 42.0% | |
| (Missing) | 601 | 33.8% |
| Value | Count | Frequency (%) | |
| 10 | 1 | 0.1% | |
| 17 | 2 | 0.1% | |
| 27 | 2 | 0.1% | |
| 35 | 1 | 0.1% | |
| 36 | 2 | 0.1% |
| Value | Count | Frequency (%) | |
| 522 | 1 | 0.1% | |
| 356 | 1 | 0.1% | |
| 310 | 1 | 0.1% | |
| 306 | 1 | 0.1% | |
| 298 | 1 | 0.1% |
_merge_prodprice
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.9 KiB |
| both | |
|---|---|
| left_only |
| Value | Count | Frequency (%) | |
| both | 1318 | 74.0% | |
| left_only | 462 | 26.0% |
Frequencies of value counts
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Histogram of lengths of the category
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 5.297752809 |
| Min length | 4 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| country | year | yield_pigs_hgperhd | production_pigs_tonnes | slaughtered_pigs_hd | stocks_pigs_hd | country_inscope | producerprice_pigs_carcass_lcupertonne | producerprice_pigs_live_lcupertonne | producerprice_pigs_carcass_slcpertonne | producerprice_pigs_live_slcpertonne | producerprice_pigs_carcass_usdpertonne | producerprice_pigs_live_usdpertonne | producerprice_pigs_carcass_index | producerprice_pigs_live_index | _merge_prodprice | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Albania | 2011 | 668.0 | 13100.0 | 196000.0 | 163000.0 | False | 563000.0 | 297488.0 | 563000.0 | 297488.0 | 5580.0 | 2948.0 | 109.0 | 87.0 | both |
| 1 | Albania | 2012 | 663.0 | 13260.0 | 200000.0 | 158818.0 | False | 600000.0 | 297411.0 | 600000.0 | 297411.0 | 5546.0 | 2749.0 | 116.0 | 87.0 | both |
| 2 | Albania | 2013 | 663.0 | 13494.0 | 203530.0 | 152000.0 | False | NaN | 302000.0 | NaN | 302000.0 | NaN | 2858.0 | 86.0 | 89.0 | both |
| 3 | Albania | 2014 | 579.0 | 11900.0 | 205368.0 | 172455.0 | False | NaN | 329667.0 | NaN | 329667.0 | NaN | 3125.0 | 103.0 | 97.0 | both |
| 4 | Albania | 2015 | 589.0 | 11424.0 | 194029.0 | 171400.0 | False | NaN | 344754.0 | NaN | 344754.0 | NaN | 2737.0 | 98.0 | 101.0 | both |
| 5 | Albania | 2016 | 570.0 | 11424.0 | 200597.0 | 181024.0 | False | NaN | 346072.0 | NaN | 346072.0 | NaN | 2788.0 | 99.0 | 102.0 | both |
| 6 | Albania | 2017 | 570.0 | 11561.0 | 203002.0 | 180087.0 | False | NaN | 352731.0 | NaN | 352731.0 | NaN | 2962.0 | 101.0 | 104.0 | both |
| 7 | Albania | 2018 | 445.0 | 10317.0 | 231680.0 | 184133.0 | False | NaN | 359129.0 | NaN | 359129.0 | NaN | 3326.0 | 102.0 | 106.0 | both |
| 8 | Albania | 2019 | 494.0 | 10232.0 | 206967.0 | 183847.0 | False | NaN | 362279.0 | NaN | 362279.0 | NaN | 3298.0 | 97.0 | 107.0 | both |
| 9 | Albania | 2020 | 494.0 | 9002.0 | 182046.0 | 158401.0 | False | NaN | 385842.0 | NaN | 385842.0 | NaN | 3551.0 | NaN | NaN | both |
Last rows
| country | year | yield_pigs_hgperhd | production_pigs_tonnes | slaughtered_pigs_hd | stocks_pigs_hd | country_inscope | producerprice_pigs_carcass_lcupertonne | producerprice_pigs_live_lcupertonne | producerprice_pigs_carcass_slcpertonne | producerprice_pigs_live_slcpertonne | producerprice_pigs_carcass_usdpertonne | producerprice_pigs_live_usdpertonne | producerprice_pigs_carcass_index | producerprice_pigs_live_index | _merge_prodprice | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1770 | Zimbabwe | 2011 | 551.0 | 19500.0 | 354000.0 | 396277.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | left_only |
| 1771 | Zimbabwe | 2012 | 536.0 | 19400.0 | 362000.0 | 400000.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | left_only |
| 1772 | Zimbabwe | 2013 | 551.0 | 20400.0 | 370000.0 | 415000.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | left_only |
| 1773 | Zimbabwe | 2014 | 550.0 | 20800.0 | 378000.0 | 238145.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | both |
| 1774 | Zimbabwe | 2015 | 549.0 | 21959.0 | 400000.0 | 345249.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | both |
| 1775 | Zimbabwe | 2016 | 549.0 | 23091.0 | 420577.0 | 425540.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | both |
| 1776 | Zimbabwe | 2017 | 549.0 | 11602.0 | 211228.0 | 242020.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | both |
| 1777 | Zimbabwe | 2018 | 550.0 | 9562.0 | 174010.0 | 230424.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | both |
| 1778 | Zimbabwe | 2019 | 550.0 | 10651.0 | 193820.0 | 279473.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | both |
| 1779 | Zimbabwe | 2020 | 550.0 | 10107.0 | 183923.0 | 272206.0 | False | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | left_only |